Speech processing and recognition using artificial intelligence methods

نویسنده

Andrzej Izworski

چکیده

Many problems related to analysis and recognition of sound signals are characterized by the fact that finding the proper analysis rule or the proper recognition algorithm is very difficult. In addition the signal analysis methods and the methods of its recognition must be strictly adapted to the specific features of the particular task considered. It is worth noticing, that the situation is essentially different from the one encountered in the sound processing. The method of lowand high-pass filtering, compression techniques or algorithms of spectral transformations for a sound signal do not actually depend on the type of signal or the purpose for which it is being registered and processed. Therefore in the field of sound signal processing an enormous progress has been achieved, and the elaborated methods are to a high degree universal. The effect is additionally enhanced by the availability of affordable [lowcost] and convenient DSP technique. On the contrary in the tasks of sound signals analysis and processing the progress is much slower and the unification of methods and standardization of algorithms encounters considerable difficulties. The main source of these difficulties is the fact that in almost every task of signal analysis the features to be extracted and the required signal parameters are different, they are strongly dependent on the specific task being solved and they are expected to provide answers for different questions. Similarly in the tasks of sound signal recognition the criteria and goals of their classification can be very different even for the same signal types. Yet, the approach unification in the above mentioned field is strongly recommended, because it enables[promotes] more cost-effective and faster development of the required solutions for particular problems. It seems that in the field of analysis and recognition of sound signals a very promising direction in the search for such unification, and the universal solutions which might lead to it, are the methods used in the artificial intelligence research. There are several definitions of artificial intelligence [1,2], but in the tasks of speech processing and recognition the most appropriate seems to be the slightly narrower concept of computational intelligence [1]. In the discussed field of interest (speech as a biomedical signal) the methods of artificial intelligence are mostly employed to: preliminary signal processing and filtering [6], determination of the space of phono-acoustic features and its visualization [5], speech recognition, identification of the speaker or pathological states [3,4,7], understanding the signal [8]. In addition to classical methods of pattern recognition, fuzzy systems or genetic algorithms the neural networks seem to deserve a special attention. The neural network MAVEBA 2001, Firenze, Italy 228 ISCA Archive http://www.isca-speech.org/archive Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA) 2 nd International Workshop

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods

For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...

متن کامل

Classification of emotional speech using spectral pattern features

Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

The Combinational Use Of Knowledge-Based Methods and Morphological Image Processing in Color Image Face Detection

The human facial recognition is the base for all facial processing systems. In this work a basicmethod is presented for the reduction of detection time in fixed image with different color levels.The proposed method is the simplest approach in face spatial localization, since it doesn’trequire the dynamics of images and information of the color of skin in image background. Inaddition, to do face...

متن کامل

Process Speech Recognition System using Artificial Intelligence Technique

239 Abstract: This paper describes the detail process of speech recognition using artificial intelligence technique. It includes coustic model, Language model,Trigram model, Class model ,Source channel model .Speech recognition or natural language processing referred to artificial intelligence methods of communicating with a computer in natural language like English. The objective of NLP Progra...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2001

Speech processing and recognition using artificial intelligence methods

نویسنده

چکیده

منابع مشابه

Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods

Classification of emotional speech using spectral pattern features

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

The Combinational Use Of Knowledge-Based Methods and Morphological Image Processing in Color Image Face Detection

Process Speech Recognition System using Artificial Intelligence Technique

عنوان ژورنال:

اشتراک گذاری